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5 P „„ -lo 6 h r T m in ; ent f ion ^ e,ates t0 computerised graphic systems, and in particular, discloses a graphics 
engine which provides for the manipulation of image data'. grapmcs 

[Prior Art] 

10 Desktop publishing systems are known in the art which enable a user to manipulate visual image data so 

as to create or amend a visual image in some manner. 9 ° 

Known systems generally have the capability of creating black and white images which can be outputted 
on either a monochrome display, or by use of a black and white laser printer outputted 

« H0W6Ver ' there e **f te a demand for deskt °P Publishing systems that operate in full colour and provide for 

is performance comparable to existing monochrome systems both as regards speed and function on the o^e hand 
and ,mage quality on the other. However, for colour systems, the amount of data required to be S^asS b 

2££ and white system - According,y - ^ — ^"--.^S 

20 SUMMARY OF THE INVENTION 

It is an object of the present invention to substantially overcome, or ameliorate the abovementioned dif- 
imaoe da 3 , Til" ' deViCe ,' hat " " * ^ ° f ^ hics ^ZTo"coZ 

« rS a n rco.^^^ 

in accordance with one aspect of the present invention there is disclosed a graphics engine to manipulate 
colour video image data on a pixel-by-pixel basis, said engine comprising a parallel data path for^ Tone Z 

TsLTZZT^ ,f T 39e dat3, CaCh Path C ° mPriSin9 3 —Connected Inte-po'tor and 
po.it or and sa,d data paths be,ng connected in parallel with a cascade connected Iransparency interpolator 

HT ? ner ' Wh , erein 6aCh °' Said C °' OUr inter P olators ~»<v colour variation commandsf each o^sa j 
compositors receive colour ,nput data for each said pixel in turn and said compositors transform satf cotour 
input data ,n accordance with a pre-determined function the output of which is dependent uponToth hetpu 
colour data and the interpolator variation command, and wherein said transparency interpolator ^eceiveVfrans 
parency vanat.cn commands, and said matte combiner receives matte plane input'data for each safd pixeTand 
35 sad combiner produces a colour blend value and an output, matte value for each said pixel by trans omZ 
sa,d input matte data in accordance with said transparency variation commands, the output o/saTd g 7h 2 
engine being colour image data simultaneously manipulated as colour, transparency and matte 

m e C a 0 ns a for^ ith """T'E? of ,he P resent inve "«°" »«™ * disclosed a ramp generator comprising: 
« „„„ , m " nsforston "9 a n«m*«r<N) as a plurality of bits of information; an accumulatorfor generating a ramp 
to output between a predetermined minimum and a predetermined maximum- and 

means for testing the most significant of said bits to establish either a step mode or a jump mode of oper- 
ation, whereby ,„ said step mode, said output increases at a rate less than 1:1 and in sa dTump mode said 
outpu increases at a rate equal to or in excess of 1:1. the value of said number determining thecal UmJ taken 
for said output to traverse between said minimum and said maximum. 

BRIEF DESCRIPTION OF THE DRAWINGS 

which: Prefe,red emb ° diment ° f the present inven,ion "« "e described with reference to the drawings in 

gShi^"Ing"e; SChe,natiC b, ° Ck dia9ramS ° f "** ^ mani ' 3ulation svste ™ "hich incorporate a 
Fig. 2 is a schematic block diagram of the preferred embodiment of the graphics engine- 
Fig. 3 is a schematic block diagram of the render processor interface of Fig 2 
Fig. 4 is a schematic block diagram of the first-in-first-out register of Fig 3- 
Fig. 5 is a flow chart showing various states of the read address generator of Fig 4" 
r ig. 6 is a schematic block diagram of the control unit of Fig. 2; ' 
Fig. 7 is a schematic block diagram of the run controller of Fig 2- 
Fig. 8 is a schematic block diagram of the run length register of Fig 7- 



50 



BNSOOCID <EP O*652S0A2J^> 



EP 0 465 250 A2 



Fig. 9 is a schematic block diagram of the run length counter of Fig. 

Fig. 10 is a schematic block diagram of the ramp generator of Fig. 7; Figs. 11 A and B illustrate the gener- 
ation of the ramp of the generator of Fig. 10; 

Fig. 12 is a schematic block diagram of one of the colour interpolators of Fig. 2; 
5 Fig. 13 is a schematic block diagram of the interpolator A of Fig. 12; 

Fig. 14 is a schematic block diagram of the interpolator B of Fig. 12; 

Fig. 15 is a schematic block diagram of one of the colour compositors of Fig. 2; 

Fig. 16 is a schematic block diagram of compositor A of Fig. 15; 

Fig. 17 is a schematic block diagram of the compositor B of Fig. 15; 
10 Fig. 18 is a schematic block diagram of the transparency interpolator of Fig. 2; 

Fig. 19 is a schematic block diagram of the transparency interpolator synchroniser of Fig. 18; 

Fig. 20 is a schematic block diagram of the transparency interpolator A of Fig. 18; 

Fig. 21 is a schematic block diagram of the transparency interpolator B of Fig. 18; 

Fig. 22 is a schematic block diagram of the matte combiner of Fig. 2; 
15 Fig. 23 is a schematic block diagram of the matte synchroniser of Fig. 22; 

Fig. 24 is a schematic block diagram of the matte delay of Fig. 22; and 

Fig. 25 is a schematic block diagram of the matte calculator of Fig. 22. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

20 

As seen in Fig. 1A, a simple form of image manipulation system takes the form of a general purpose com- 
puter 1 , a graphics engine 1 0, a work screen memory 30 of typically 3 Mbytes maximum and a screen or monitor 
3 capable of high resolution. In this system images stored in the computer 1 are able to be displayed on the 
monitor 3 and manipulated via the graphics engine 10 and the keyboard (not illustrated) associated with the 
25 monitor 3. 

Another type of image manipulation system shown in Fig. 1 B is associated with a colour laser copier 5 which 
is preferably the CANON CLC500 COLOR LASER COPIER (Registered Trade Mark). The copier 5 includes a 
scanner 6 and printer 7 which are connected to a page store or memory 4. The page memory 4 is typically 96 
Mbytes maximum for an entire A3 size 400 dots per inch (dpi) image. The data rate of this copier is 13.35 MHz 

30 which is substantially the same as a standard video broadcast data rate of 1 3.5 M pixels/second. The computer 
1 and graphics engine 10 are substantially as before. 

As indicated in Fig. 1C, it is possible to combine both the above systems by the provision of a data bus 26 
interconnecting the graphics engine 10, the work screen memory 30 and page store 4. With this configuration, 
a image read by the scanner 6 can be manipulated whilst displayed on the monitor 3 and then printed out using 

35 the printer 7. 

Fig. 1D illustrates a similar system save that a pan/zoom engine 9 is provided between the data bus 26 
. and the work screen store 30 to further increase the nature of the image data manipulations able to be perfor- 
med. 

The system of Fig. 1E is concerned essentially with video images which can be either still video or lines 
40 of a live video for broadcast or other purposes. Here the graphics engine 10 provides an output (only) to a line 

store 40 which operates on the first in first out (FIFO) principle and which in turn outputs to a triple digital to 

analogue converter (DAC) 41 to provide the necessary output video signal 

Lastly, as seen in Fig. IF, a inter-active video image manipulation system incorporates the computer 1 and 

graphics engine 10 as before, however, the graphics engine 10 both outputs data to, and receives data from, 
45 a double bufferred frame store 42. The frame store is able to receive video data from a video source 43 and 

output video data to a video destination 44. 

With the above applications in mind, the preferred embodiment of the graphics engine 10 of the present 

invention will now be described with reference to Fig. 2 and with general reference to an image manipulation 

system of the general type depicted in Fig. 1D. 
so It will be seen in Fig. 2 that the graphics engine 10 includes an input data bus 11, which provides a video 

input, and an output video data bus 19. Also included within the graphics engine 10 is a render processor (RP) 

interface 1 00, a run length controller 200, and a control unit 300. A data path includes interpolators 400 for each 

of red, green, and blue, a transparency interpolator 600 as well as compositors 500 for each of red, green, and 

blue and a matte combiner 700. An address generator 800 and a write controller 900 are also provided. 
55 Commands enter the graphics engine 1 0 via the RP interface 100 and both the bus 12 which provides RP 

data and the bus 13 which provides RP control. Signal lines 14 provide for direct memory access (DMA) control. 

The control unit 300 accepts two clock signals 15 which include a pixel clock running typically at 13.35 MHz. 

The control unit 300 reads data from the RP interface 100, interprets those commands received, and passes 
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the command operands to the run controller 200, and to the data oath inri„ri,w. w*. , . 
operands are also passed to the address generator 8M a„?Cl* IS 

also responsible for decoding microcode from these command* JZnhu . k ° ntrol unit 300 is 

execution devices within the engine 1 0. commands, which is subsequently used by respective 

s The run controller 200 is initialised with the run length of the oarticlar n^h v 

unit 300 and produces a lineariy increasing number in Sfa™ of « ™ 7 ^ t command bv control 
run. This ramp is used by the interpolate to conVol ^.he mix nalroooZ dUrat ''° n ° f the 

Thegraphics engine 10 is configured to operat i UDO^red Zin n h commands - 
is an industry standard. The colour data pa?h o red Tareen Z Z, ' ? ^ (RGBM) inf °™ a «°n which 
'0 a blend command, is initialised by the control un ?300 w«h TttXZT**?"* ^ 96neral case ° f 
path produces a blend between tLe two co ours and^n com l^ , end ^ The «*«" data 

the current colour, in the pixel memory (wort sc^e store 30 o™1f *• J^Undudlng a percentage of 
transparency data path includes a ta^U-m^^^S! * » 4) T"'* RGBM ° ut bus 1 9 - Tha 
of a transparency blend command, the LnsZltXZT^ZZ ^V^ Smf> ' e ^ 
15 transparency and an end transparency The fransoarencvTa I nl 2 „ * C ° ntrol 300 witn a sl *« 
va.ues. and the result is used tc Control th ^ composTo Int P T™"* * ^ betWeen ,hese ^ 
rency blenc I can a,so be written or -n^S^T^ ^ ^ 

-d^sses^^^^ 

20 page memory 4. which is interconnected to the buses 1 and 19^^^ °' a "° Win9 feadS fr ° m ,he 
the work screen memory 30. and visa versa. ' ° CCUr parallel wi,n wit ^ '° 

The write controller 900 provides synchronised read and write sicnal* tn ih» „• > 
tosynchronise tothe pan/ Z oom engine 9, the w*^^^,*^,^' 4 ' 3 °" ,n 
in all execution units of the engine 10 when wailino fnr™ m „. ,. P , aSTALLsl 9 naltos "spend operation 

« memory 30. The write controller 900 aiso piZe ofslf °\ ""if aCCeSS l ° me WOfk SCTee " 

used by other execution units. n* 1 "*'"* a P'pelme of status signals which contains the micro-code 

cree^ue^ — component colours, red. 

positor 500. The interpolators 40 are ^rlsponsfb.^ for TnerZ, T tT^" 3 *" inte -Polator400 and a com- 
o are responsible for mLg the biend w^ 'he p^l ImoT ' C0 ' 0UrS ' "« *° ™ 

posiST^ 

compositors 500 with the mixing proportion wntcha selecte tam' „» T T °° Pr ° V '' deS the c0,our 

The matte output is also fed tome red ^.^^.^3^2 « let ™ s P a ™cy -urces. 
» to the matte plane, to be written to the red planefcstead Wf " Ch W ° U ' d haVe been writte " 

wi„ ^Z^ZZZSZ?"' 9raPhiCS 1 °' *° «*- -,-s components thereof 

pJ^^^^^j^ p k :;,rc p o:rn°d mmand r the — -* — « - 

- command is complete, the graphics instruct* in i tL wT a "sT^tfonarTh' 0 n'f ' PreV ''° US 
the address generator 800 proceed in oarallel anH »mH Z. r . • . 9 6 data path 40 °/500 and 

the pixel clock PXCLK 15 A nev» pixel is thirlft „ " * mW ™ Um ° f Six Cl0ck periods °" 

unless stalledbyaperipner^^^^^^ 

150 (Fig. 10 to be described hereafter) within the run contZ « 9 , ,nstnjction - a ram P generator 

0 to 255 and indicates to the controller Z ^U^^Z^TT^^ '' nCreaS ' m9 ™<» b "«°<» 
controller 900 interfaces to the pixel memory 4 3 rJ Z < ? mple ' ed v,a a signal. The write 

also interfaces to a further bus 20 s3yZ Z ^ via the bus 2 1 ■ The write controller 900 
track of when and where to read and wn'te pixels 9,06 ^ WOrkscreen ™™V 30. and keeps 

manSTonfigu^n tZlT^tl^:^ ^ 1 °" TheSe are; ««»P-«"9 com- 

mon type of instruction ^t^SZ^X^TT^ C ° mp0Sit,n9 commands •» •» most corn- 
operations in the page store 4 and onTe s£«n w- * 1 1 '° Perf0ml 00tour linear Wendin 9 
used to set up internal or external hard^relnnecZ ^ SCreen ^ 3a Confi 9™«° n commands are 
sation commands are used to £ ^Sraph^ enl^ 1 n t0 h eXeCutin 9 compositing commands. Synchrony 
hardware events. Table 1 shows the three rvntfo? ! s y nchronlsed to intema ' hardware events or external 

The source and destination o btenZ. ™- ' nStrUC "° nS a " d their respectl ' ve instructfon codes, 
the workscreen 3 (WS). C^gj the e' ^S?^" eUher the C ° mp ° Si,in9 pa ° e 4 (CM, or 

compositing memory 4; compositin! ^ wi hfnthe l^ P """"i 58 '"^ commands: compositing within the 

mpositing within the workscreen memory 30; compositing from the compositing mem- 
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ory 4 to the workscreen memory 30; and compositing from the workscreen memory 30 to the compositing mem- 
ory 4. During CM to CM compositing, one pixel is read and written in every pixel clock period (75 ns). Reads 
are performed in the first half of the pixel dock periods and writes in the second half. The graphics engine 10 
uses a i pipeline delay system in which data is transferred sequentially between various devices within the 
5 engine 10. Accordingly, there is a delay between reading the "old" value of a pixel, and writing the "new" value 
of the pixel. For this reason, the compositing memory 4 address changes between the first and second halves 
of the pixel clock period. 

Due to timing constraints in the compositing memory 4, there is one pixel clock period delay between issuing 
a read address, and reading the data at that address. This delay is compensated for by adjusting the position 
10 within the graphics engine 10 pipeline where the read data is received. The compositing memory 4 also has a 
one clock delay when writing pixel data, but this does not affect the internal operation of the graphics engine 
10. 

During WS to WS compositing, a maximum of one pixel is read or written in every pixel clock period (75 
ns). Reads are requested in one pixel clock period, and writes in another. Because of pipeline delays in the 

15 graphics engine 10, for the same reasons as described above, the workscreen memory 30 address changes 
between successive read and write requests. 

Due to timing constraints in the workscreen memory 30, there can be a delay between issuing a read 
request, and reading the data at that address. Handshake signals from the pan/zoom engine 9 indicate when 
the data is ready. This delay is compensated for by stalling the graphics engine 10 pipeline until the read data 

20 is received. The graphics engine 10 reads the data from the monitor or workscreen 3 in the clock period after 
the read acknowledge handshake signal is received from the pan/zoom engine 98e 8, in order to be compatible 
with the one clock delay in the compositing memory 4. Latches, (not illustrated and external to the graphics 
engine as earlier described), hold the data during this delay. The workscreen memory 30 can also have a delay 
when writing pixel data. Typically, the first pixel written will not be delayed, as the write request is "posted", 

25 and performed in subsequent clock periods. The pan/zoom ratio must be set to 1 :1 for WS to WS compositing, 
as will be understood by those skilled in the art. 

CM to WS compositing is used to transfer pixel data from the compositing memory 4 to the workscreen 
memory 30. During CM to WS compositing, a maximum of one pixel is read and written in every pixel clock 
period. Reads are requested from the compositing memory 4 in the first half of the pixel clock period. Writes 

30 are requested from the workscreen memory 30 during the entire pixel clock period, but the address and data 
are only available in the second half of the pixel clock period. Separate addresses are maintained for the work- 
screen address and the compositing memory 4 address. 

Due to timing constraints in the compositing memory 3, there is a one pixel clock period delay between 
issuing a read address, and reading the data at that address. This delay is compensated for by adjusting the 

35 position within the graphics engine 10 pipeline where the read data is received. Due to timing constraints in 
the workscreen memory 30, there can be a delay between issuing the write request, and receiving the corre- 
sponding handshake signal from the pan/zoom engine 9 which indicates when the data has been accepted. 
This delay is compensated for by stalling the graphics engine pipeline until the handshake is received. The write 
address and data will be presented again in the second half of the next clock period until the handshake is 

40 received. Typically, the first pixel written will not be delayed, as the write request is posted, and performed in 
subsequent clock periods. 

CM to WS compositing can be used in conjunction with a reduction zoom ratio to reduce the size of the 
workscreen image. In a reduction mode, the pan/zoom engine 9 discards pixels from the graphics engine 10, 
and only writes every "NTH" pixel. This is transparent to the operation of the graphics engine 10. as the only 

45 difference it perceives is that writes appear to be completed quicker. 

CM to WS compositing can also be used in conjunction with a magnification zoom ratio to increase the 
size of the workscreen image. In expansion mode, the pan/zoom engine 9 reads a pixel from the graphics engine 
10. and writes the next "N x N" pixels with the same value. This is also transparent to the operation of the 
graphics engine 10, as the only perceived difference is that writes appear to take longer to complete. Note that 

so only the first pixel address in a run is used by the pan/zoom engine 9. The graphics engine 1 0 generates addres- 
ses for subsequent pixels, but those addresses are not required by the current implementation of the pan/zoom 
engine 9. 

WS to CM compositing is used to transfer pixel data from the workscreen memory 30 to the compositing 
memory 4. During WS to CM compositing, a maximum of one pixel is read and written in every pixel clock period. 
55 Reads are requested from the workscreen memory 30 in the first half of the pixel clock period and writes are 
performed on the compositing memory 4 in the second half of the pixel clock period. If the workscreen read 
has not been acknowledged by the end of the pixel dock period, the pipeline will be stalled. The read request 
will remain pending, and the write to the compositing memory 4 will be repeated at the same address. Separate 
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addresses are ma.ntamed for the workscreen memory 30 addresses and the compositing memory 4 addresses 
Due to tailing constraints ,n the workscreen memory 30. there can be a delay between 7s^no a r«H 

^nl?; a / 6 9 H th TH data /, tthataddreSS - A9ain ' handShake S *> nals fr ° m »» Pan/zoom engine g'indic^e 
when the data ,s ready. Th,s delay ,s compensated for by stalling the graphics engine pipeline 10 until th read 
data .s received. The composing memory 4 has a one clock delay when writing the pixel date bu h? Z 
not affect the initial operation of the graphics engine 10.' ' ,h,s does 

WS to CM compositing can be used in conjunction with a reduction ratio to increase the size of the im aoP 
as ,t is cop,ed mto the compositing memory 4. In this mode, the pan/zoom engJnTiwt^ses a Z? 
address during the first pixel in a run. but only increments this address after every "N" X ead reoTetts fi^m 
the graph.cs engine 10. This has the effect of expanding each pixel, but is transparent to he oSn oft 
graph.cs eng.ne 1 0. Note that on.y the first pixel address in a run is used by the pan/zoom Engine 9 I The o™h ~ 

IZZ !° 9enerates addresses for subsequent pixels - but these ■^*"»tSS B SS2:ssS- 

enginTa 0 ^ C ° mP ° SiHn9 mUSt n ° l be used in ""Junction with an expansion zoom ratio in the pan/zoom 

rn™H 9U Th i0n COmma " ds are used to sel U P intema ' and eternal hardware prior to executing composite 
to beTn^ed C ° mman<1S ^ * C ° mp ° SiBn9 C °~ ds to a »°* long instruct Ttnaam^ 

a ,* Loa "™ ratio command passes a parameter word following the instruction to the pan/zoom enoine 
9 vn the workscreen address bus. The expansion factor appears on bits 3 to 0 of the parameteTan^Te reX: 
tion factor appears on bits 7 to 4. Timing for the transfer is the same as for a WS write command lan* I T.Z 

~rme? e r n,enTO 

ora nt LOad ^n^T 6 (GE> Sa " Confi S"" a «°" command allows various modes to be set within the 
graph cs eng.ne 10. .deal.y. these modes are contained within each graphics instruction. A limited numbe of 

ameter °" ^ Tab,e 3 shows »• *"* « ^SS^. 

red" B 'L 6 ,tl h ,T 9h 4 ° f ' h ! Parame,er followin 9 the ,oad GE configuration instruction configure the "matte to 
red , matte to green", and "matte to blue" modes respectively Bits 3 to 0 of th* n^n^jT, T . . 

write, to mdivdual planes in the workscreen memory 30. Compositing memory 4 writes are not affected bv h~» 

Bi fs Su^O^ Wri ' e , ena , b,eS " 9raPhiCS inStajCti ° nS Ca " a,S ° d iSable ""'^ b "< n°< ^ n~ b plane s 
n L STS, * 6 k S , , ° a "° W WhiCh h n0m,a " y written t0 ,he matte Pl a " a . <° be written to me red 
°ll 1 Z SCreen COmposi,in 9 °P era «°- <° ^ sped-up in certain circumstances When 
» b l T P « T? US6d ' thS 200m rati ° iS 1 :1 ' tnese ,hree bits can be set The SBR bit enables the 
bH in the Z ™ ^ iS m0de - SCrSen rSadS are " 0t re£ " uested from *e workscreen ^ the app'oprilte 

conlml P r, . h3Ve diSab ' ed "" subse ^ent screen write. If the read is "skipped" thfpfce w H 
consume only one p.xel clock period, instead of the three or more required by the current implementeTon of 

Ire r u b S Z ,°arr 9me ^ TeXttyPiCaHy °° ntainS 3 ,ar ^^-Seof"wh\es P ac y e»,so,he P o^L e :^ 

The SBW bit enables a "skip bit matte writes" mode. This mode disables screen write requests when the 

t~he e sk p' n b :atle m r a L maSk > °" ^ *° «™ " ane ^-2^ In order 

teoom enntn J Q l "° d " '° fUnC,i ° n with *• current ™plementatton of the pan- 

^ e " 9,ne 9 - " IS necessary to assert a line of the compositing memory 4 control bus 21 in every read or 

TL7« Z 6 r' 200 " 1 en9i " e ' S aUt ° matic address calculations. The SER and SEW bL assert 

viduallt of co| e e 9 c .^, C °Th rnand a,IOWSmost of lne intemal re 9^'ers of the graphics engine 10 to be loaded indi- 
vidually or collect.vely The configuration of the load register instruction is shown in Table 4 This command is 

InTn ^ T"?'?^ " eXeCUte microc ° d «" Instruction for testing purposes. Bite 27 mrauTo of the 

^ « MorTrnan 9 ' 5 '" " ^ !* '° aded " ^ >™™*<™^ Ins^ctionTthen ,o del 
? « rttfinfn • ? u ° ne re9 ' Ster can be loaded with the parameter For example registers 0 to 

^^tS^ atU : e SamS timS b6CaUSe 6aCh re9iS,er USSS *«erentbit;3 h?pa^ 
issued m,crocode reg.ster would also be loaded, before the "execute microcode" instruction 

c-nrlZ^ 9 ' h a T TeXtUre ° ffSet fe9iSters (to be later Ascribed) cannot be loaded using the load register 
T h rr Bh > re9 ' S, T Ca " ed " a " " ,n lengtn " and " alt texture offser registers, however c^n be olded 

The graph.cs p.pel.ne uses these registers instead of the usua. registers whence "execute microcode" 



EP 0 465 250 A2 

tion is executed. Table 5 provides a description of various registers provided within the graphics engine 10. 

Synchronisation commands are used to maintain the graphics engine 10 synchronised to internal and exter- 
nal hardware events. There are two synchronisation commands, namely no operation (NOP) and WAIT. 

The NOP instruction has no effect upon the graphics pipeline other than to consume one word and one 
5 clock cycle. It is intended to be used at the end of an instruction stream so that the length of the stream is a 
multiple of four 32-bit words. This ensures that the DMA controller, provided on bus 14 in Fig. 2, and which 
transfers data in four word bursts, will always send the last command. 

The WAIT instruction prevents the graphics engine 10 from executing new instructions unt3 one of three 
events occur. Those events are: a hardware restart signal is received, a software restart is received, or the 
10 graphics pipeline becomes empty. Options to the instructions select which of the events will restart instruction 
execution. Pixels from the previous instruction which are still in the graphics pipeline, are not effected by this 
instruction. 

The hardware restart option allows the graphics engine 10 instruction execution to be synchronised to 
external "real-time" events. This is required when performing a "test scan" where the graphics engine 1 0 is used 

15 to copy an image from the compositing memory 4 to the workscreen 3, while the scanner 6 is writing to the 
page memory 4 simultaneously. The graphics engine 10 and the scanner 6 operate on opposite halves of the 
page memory 4, and swap over every eight lines. The instruction stream presented to the graphics engine 10 
is separated into multiple strips of eight lines. A single WAIT instruction is inserted between the compositing 
commands for each strip. This forces the graphics engine 10 to wait for the scanner to "catch-up" before pro- 

20 ceeding to the next step. 

A software restart option allows the computer 1 to be synchronised to the graphics engine 10 instruction 
execution. If the computer 1 needs to perform certain tasks between groups of graphics instructions, a WAIT 
instruction can be inserted between the groups. When the graphics engine 10 reaches the WAIT, an interrupt 
is optionally generated, and the execution of the new instructions is suspended until the computer 1 writes to 

25 a CONTROL register with the RESTART bit set, as will be discussed later. 

The pipeline empty restart option allows the graphics engine 10 to wait-until the previous instruction is fully 
completed before starting a new instruction. This instruction is generally not required in normal operation. It is 
only required if a design error allowed neighbouring i instructions to effect each other. Table 6 shows the bit 
relationship of the WAIT instruction. 

30 In addition to registers provided for the DMA bus 14 in Fig. 2, registers are also available for the computer 

1 to interface with the graphics engine 10. Table 7 lists internal registers of the graphics engine 10 showing 
the relative address, size and whether the respective address is readable or writable. The COMMAND register 
provides an alternate mechanism for writing graphics commands to the graphics engine 1 0. Writing to the COM- 
MAND register places the input data on the top of a FIFO 150 within the RP interface 100, and is effectively 

35 the same as writing via the DMA bus 1 4. The status of the FIFO 1 50 should be read from the STATUS register 
before writing to the-COMMAND register, to ensure that the FIFO 150 is not full. Writing graphics commands 
via the COMMAND register is relatively slow, as it is not expected to be used in normal operation. It does, how- 
ever, provide a means of "single stepping" the graphics engine 10 for debugging and self test purposes. The 
FIFO 150 can also be written one word at a time. The control unit 300, however does not read the FIFO 150 

40 until it contains at least four words. Table 8 shows the arrangement of the command register. 

The CONTROL register is used for setting interrupt , masks, clearing interrupts, and enabling various test 
modes, and includes a bit configuration as shown in Table 9. 

The HIE bit is the hardware interrupt enable. If set to a 1, the computer 1 will receive an interrupt when the 
graphics engine 10 receives a WAIT command when the "hardware restart" option is enabled. This interrupt 

45 is cleared by writing a 1 to the CHI bit (even rf the graphics engine 10 is stopped). The setting of the HIE bit 
can be read from the STATUS register. 

The EIE bit is the "ERROR" interrupt enable. If set to a 1, the computer 1 will receive an interrupt when 
the graphics engine 10 detects an error condition. This interrupt is cleared by writing a 1 to the CEI bit, until a 
new error is detected. An error is detected rf a hardware restart or a software restart is received when the 

so graphics engine 10 is not in the STOPPED state. In the case of a hardware restart, this indicates a loss of 
synchronisation with the external real time event The current setting of the EIE bit can be read from the STATUS 
register. 

The RST bit is used to restart the graphics pipeline. When this bit changes from a 0 to a 1, the graphics 
pipeline is restarted. 

55 The SWR bit is used to generate a software reset When set to a 1. the entire graphics engine 10 is reset 

to the power on reset condition. The SWR bit does not need to be cleared by a programmer, as the CONTROL 
register is also reset. 

The SEE bit is the "stop on error* enable. If set to a 1, the graphics engine 10 will enter the STOPPED 
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condition if an error is detected. 

Six bits designated TMn are used to enable test modes in blocks within the graphics engine 10 TMO is 
used to put the run length counter 220 into a test mode as will be discussed later. TM1 is used to put paoe and 
screen address counters, within the address generator 800 into a test mode, as will also be discussed later 

The STATUS reg.ster is used to monitor the interrupt flags and the FIFO 150 status, to read the version 
number, and to read the status of the various test modules. Table 9 shows the configuration of the staJs reg" 

c , c ^V^' . SIE E ' E bitS refIeCt me Stat6 ° f ,he interrupt masks set in the CONTROL register The HIF 
sn^l^Z'^Z fla9S - ' thBSe " a9S " ^ enablebit is set! 

»n=I h8 TK F k-V S S f Whe A the 9raPhiCS enQine 1 ° receives a WAIT co "«nand with the "software restart" option 
enable. This bit is cleared by writing a 1 to the CSI bit in the CONTROL register 

in thlcONTROL registr *" C ° nditi ° n " deteCted - ™ S h C ' e3red WriBna 3 1 '° the CEI bit 

nl rp^' ' S T T B "^ e RFO 150 iS fU "' indiCat '" ng ,hat the COMMAND register should not be written 
The GR bl t ,s set when the graphics engine 10 is running. Running is defined as "not waiting and the FIFO 
150 is not empty", (i.e. when the graphics engine 10 is executing an instruction) 

F.Fn^n D "irlf 3 ° MA controller - with lhe Rp conWIar 120. detects an overrun condition in the 

FIFO iso. The TSn bits reflect the leve. of a number of internal modes which are used during testing by he 
dev.ce manufacturer durmg manufacture of the graphics engine 10 using LSI technology. The Vn bits contain 
the version number of the integrated circuit (LSI) chip. contain 
The graphics engine 1 0 inputs and outputs can be classified into four categories- 

1. Interface signals with the computer 1, and in the preferred embodiment, the computer is an i960 pro- 
cessor operating at 33MHz; H on laou prc- 

2. Page memory 4 interface signals; 

3. pan/zoom interface signals; and 

4. other interconnect signals. 

Table 10 provides a list of each of these signals. 

The computer 1 (i960) can access the graphics engine 10 in three ways, either by DMA writes register 

ZtZu*AT h T s - T H he p DMA interface 14 is used to transfer 9raphics commands fr ° m *° £ o 

the computer 1 at h,gh speed. Register wntes are used to configure the graphics engine 1 0. and register reads 
debug" mod°eT Sta ' US °' en9i " e ' °" re3dS a " d ™ a ' so "sed in tesTand 

dM S n S P h Sed DMA acce *f es are reauired in ° rd er to transfer large images to the graphics eng ine 1 0 without 
degrading the operahon of the graphics system, seen in Fig. 1. The graphics engine 10 has an internal FIFO 
1 50 which accepts four consecutive 32-bit words from the DMA controller of the computer 1 in a single DMA 
Z nrV^ " knOWn 33 3 " qUad W ° rd bUfSt flyb V" DMA *»"*r- A DMA request is issued by 

h! fT'T T 8 "? ! n6Ver ^ F ' FO 1 50 h3S r °° m f ° r at least four 32 " bit words - Tne DMA droller o^ 
the computer 1 subsequently acknowledges the request and writes four 32-bit words of data in a single burst 

rT M S of 2 1 TTt the ' ranSfer fe ^ermined by the access time of the memory (usual.y dynamic 
RAM) of the computer 1. The tone between accesses is determined by the available bandwidth of the bus inter- 
connecting the computer 1 and graphics engine 10). If no other DMA channels are operating, the graphics 

IZZ 1 ' t S h 9Uara . n f ed ? l6aSt 5 ° % aC ° eSS t0 ,hiS bUS " AsSumi "9 ™ st accesses * the computer 1 are 
JZX t rl tI 6 ^ T '° * he 9raPhiCS 6n9ine 1 0 is a PP-°x^tely 432-bit words every 26 processo 

wnfn SU^O ST mi " i0n " C0 ' OUr PiX6lS M ° f 20 me9abyt6S ^ ^° nd 31 33MHZ 

and T^^^X^:^! ^ dat3 ^ 3ddreSS Mne - 3 

band^.hTf'Inn 6 betWe ? h« 9raphiCS e " 9ine 10 and the com P° si « n ° Paae memory 4 preferably has a 
nerted m ° f V 1 °° ™9* b ** s Per second. One 32-bit pixel is read and written in every pixel clock 

The da L 1 ^h ^ H Perf ?- mled in ' he firSt h3lf ° f the Pixel d0Ck periods - and the Second half. 

^2 bit^care mf.l T*™ **" ^ ^ Pe "' 0d - Si " Ce the width ° f lhe da » a « ^gh 

loi™™ be taken to ensure that there is no contention between the graphics engine 10 and the conv 

posibng memory q when the bus 19 is changed from input to output, and output to input 

ronX^Th'"" 9 T"" 5 ^ 4 US6S St3tiC Cnips with an access lime ° f 25 *• 'n order to relax timing 
constraints on the gra P h,cs engine 10. a one clock delay is inserted between issuing a read address, and real 
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ing the data at that address. This delay is compensated for by adjusting the position within the graphics pipeline 
of the engine 10 at which the data is accepted. 

Furthermore, in order to relax timing constraints on the graphics engine 10, a one clock delay is inserted 
between issuing a write, and when the write is actually performed. This delay is transparent to the operation 
of the graphics engine 10. 

The interface between the graphics engine 10 and the workscreen 3 uses the same address and data bus 
as used to read and write to the compositing memory 4. The workscreen memory 30 is shared between an S 
Bus device, display refresh, and reads and writes from the graphics engine 10 via the pan/zoom engine 9. As 
a result, there is a delay between issuing a read request, and reading the data at that address. Handshake 
signals from the pan/zoom engine 9 indicate when the data is ready. This delay is accounted for by stalling the 
pipeline of the graphics engine 10 until the read data is received. The workscreen memory 30 can also have 
a delay when writing pixel data. Typically, the first pixel written will not be delayed, as the write request is posted, 
and performed in subsequent clock periods. 

The preferred hardware implementation of the graphics engine 10 can now be described having introduced 
various control functions that the graphics engine 10 provides. Not all hardware components are described in 
detail as some perform functions that are readily identifiable from their name and hence are readily understood 
by those skilled in the art. 

Unless otherwise stated, all latches and flipfiops to be referred to use the main pixel clock signal PXCLK 
(operating at 13.35 megahertz). Enable signals do not clock these devices directly, or gate the clock, but simply 
select new data via a front end multiplexer which would otherwise recirculate old data. The various modules 
of Fig. 2 can now be described. 

The render processor interface 100 of Fig. 3 is responsible for controlling and monitoring the status of the 
graphics pipeline of the engine 1 0 from the computer 1 which is preferably an i960 processor. All control signals 
between the graphics engine 10 and the computer 1 are synchronous to the computer dock HCLK. The RP 
interface 100 resynchronises the signals to the pixel clock (PXCLK) which is used by the remainder of the 
graphics engine 10. The RP interface 100 is divided intr a control block 120 and a first- in-first-out shift register 
(FIFO) 150. 

The RP controller 120 is not shown in detail but generally includes an address decoder, a control register, 
a status register, an interrupt controller and a DMA controller. The address decoder generates register select 
signals for the control register, the status register and the command register. The control register, when selected 
by the address decoder is loaded with data from the computer 1 which is used to configure the interrupt, 
synchronisation, and test modes as earlier defined. The control register contains resynchronisation flip-flops 
to synchronise to the pixel clock PXCLK. The status register allows the computer 1 to monitor interrupt flags, 
the status of the FIFO 150, and other internal conditions as defined earlier. The status register also contains 
circuitry to regenerate synchronised reset signals and status outputs. The interrupt controller combines interrupt 
flags from the status register where the interrupt enables from the control register to produce the output INT of 
i the interface 100. The DMA controller generates a DMA request when ROOM_44, seen in Fig. 3, indicates 
that the FIFO 1 50 has room for four more words from the computer 1 . When the computer 1 grants a DMA cycle 
via the DACK signal, a new word is written into the FIFO 150 in each processor clock period where NEXT is 
asserted. If a DMA acknowledge is received when a DMA cycle has not been requested, an error condition is 
generated. A WRITEFF signal is asserted for each valid DMA word. 

The FIFO 1 50 as seen in Fig. 4 uses an eight word 32 bit dual port RAM 170 to interface the computer 1 
with the pixel pipeline within the engine 10. A write address generator 155, synchronised to the processer clock 
PLCK interfaces the computer 1 to the RAM 170. A read address generator 160. synchronised to the pixel clock 
PXCLK interfaces the RAM 170 to the pixel pipeline. Handshake signals between the two address generators 
are synchronised in each of their respective receiving circuits. 

The write address generator 155 operates synchronously with the i processer clock PCLK to produce an 
address and a write pulse to the RAM 170. The generator 155 also generates a signal to indicate when there 
is room for four more words, and two signals to indicate when the two halves of the FIFO 150 contain valid 
data. The FFMT output provides an indication to the computer 1 via the status register of the RP Controller 
120 that the COMMAND register should not be written. 

The generator 155 is implemented with a three-bit counter which is reset to 0 when the RP_RESET signal 
is asserted. The counter increments when the WR]TE_FF signal is asserted. 

The read address generator 160 operates synchronously with the pixel clock PXCLK to produce an address 
to the RAM 170. The generator 160 provides a signal to indicate when the FIFO 150 is empty and two signals 
to indicate when the two halves of the FIFO 150 have been read. The generator 160 is implemented with a 
four-bit state machine which is initialized when the PX_RESET signal is asserted. 

Fig. 5 shows a state transition diagram which illustrates how the address and the "empty* signal (MT) are 
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generated within the read address generator 160. 

The control unit 300, seen in Fig. 6, interfaces the render processer FIFO 150 to the orachies nin»ii„» 
unit 300 consists of instruction and microcode registers 305 and 310. an instnicZtnterpreter 320 and ^ 
instruct™ decoder 350. The instruction interpreter 320 parses incoming instructions and pa amete«' Zr, 
the registers 305 and 31 0 in accordance with the command received. The In^^ST^S^fS 
instruction reg.ster 310 to produce a status word which is passed to the graphics pipeline Tan "EX Urnnc* 

Th isTor-' 5 ? ,nStnJCti ° n deCOd6 iS byPaSS6d b * the data * - de register 305 

This code is used for test and debug purposes. 'egisier JU5. 

The construction and operation of the instruction interpreter 320 and instruction decoder 350 will h» = 
rentto those skilled in the deve.opmen, of apphcations specific ^ocon^ZZT^n Z deve'oT 
ment of each new m.crocontroller, a new or modified instruction set must be developed whfch when readfna 
data, must involve the interpretation and decoding of various commands 9 

The run controller 200, seen in Fig. 7, interconnects the RP interface 100 with the data oath inrt..rt, nn in,= 
Poaor S 400 and compositors 500. The run controller 200 performs two tasks Zugh^ 
polat,on ramp and to md.cate when the last pixel in a run is being composited. To perfom ™ITm"s * 
is required that the run controller 200 be loaded with the run length, a start pulse, a valiZdicator and 
jndicator. The run length is loaded from the FIFO 150 database when ins ducted bylhelsSn .nterprete 

ftan tTnJr T ? Start a " d Va "' d Signa,S a,S0 come from the ins tructi°n interpreTef azo S£ 

stall signal comes from the write controller 900. As seen in fiq 7 the run contrnii»r inn JwTT 

register 210, a run length counter 220. and a ramp generator 250 «""*»200 '"*<udes a run length 

216 T Trt7^h n , 9 ^ re9 f er 2 l°- ShOW " m ° re de ' ail F ' 9 - 8 ' indudes latehes 212 and 214 and a multiplexer 
«n?h I C ° 6 ™ le " 9th ° f thS nSXt 9raphiCS c °"""^nd. The current graphics coTmoE^fn 

rs^s^ 

^ T ^ To'^ 9 " 1 , C ° Un ' er 2201 See " in deta " in Fi9 ' 9 ' inclljdes a multiplexer 222, a decrements counter 

oositerfa rf 2 rf' ' f^f " ^ * ^ Jhe ™ le " 9,h c ° unter 220 the number of pix'conl 

pos.ted and md.cates when the last pixel is in progress. The counter 220 is initialized with one less than"^ 
value ,n the mn length register 210 when the START signal is asserted. The counter 220 decrements ?n ^M 
cycles, includmg the start cycle, unless STALL is asserted or VALID is neaated in whirh r«»^ > 
retains the current va.ue. When the counter reaches 0. the LAST signal is Asserted The mtinTum 6 TT 

real 1 P H 7 ^ ™ 9eneraleS a lineari * increasina from 0 to 255 overThe 

real t,me durafon of the graphics instruction being executed. The generator 250 is initialized withTe vatui in 

t^AuC^ 

b„ t l h ^T 9ener ? l ^ 25 ° comprises an in P ut latch 252 which supplies a multiplexer switch 254 selected 
indiv'd J 7 T h' ^ mul,iplexer 254 °"tP"ts the 13 bit data word, representing the run lenmh to fotfr 
ROM 260 whSrr, I™*™ S6lective bite ° f word. The eight leas, significant bits are suppl'ied ° a 
ROM 260 which comprises a lookup table outputting a value corresponding to 255/(RL-1) All 13 bi s of The 

r ^fuT^^ - - 

The multiplexer 282 ,s switched when the most signficant bit output from me ac^mu.atorD^X a^Ts asser' 
*L The generator 250 afco includes a summer 290. further multiplexers 286 and 288. a flipflop 294 an fatcn" 

ted back to the summer 290, with the eight most significant bits outputting to the multiplexer 288 As indir^ 

ttnSZZV" bit> bit 15 ' feeds back to switch ,he » er 282 -^£&T£Z 

unZT* -f nd B f h °^ he ^ WayS Which the rarn P out P ut of generator 250 is formed Note that an 
whfn fht V ,k 250 d06S n0t necessari, y imp 'y the ramp output value is changed (as in the ^Lse 
* ' en9lh IS 9reater than 2 55). The generator 250 operates in two modes. Firstiy a >mp • mode w ^en 

l 6 ~"' en,h ' S 'Tk ha " ° r eqUa ' t0 255 ' and a " step " ™ de whe " the run length is greatTLn 255 The 
Z™U,T7 y a r Xami " ati0n 0f the r,ve most si9n ^"t "its of the run length. In me ump mode The 

When in the jump mode, the output value from the ROM 260 is added.to the accumulator 292 which is 
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initialized to zero by the START signal. The most significant bits of the accumulator form the ramp output as 
can be seen from Fig. 10. 

When in the step mode, the generator 250 applies an algorithm similar to Bresenham's line drawing 
algorithm, which is known to those in the computer graphic arts. The accumulator 292 is initialized with the con- 
5 stant (511 - RL), and the ramp counter is initialized to zero. In subsequent clocks, one of two constants are 
added to the accumulator:- 51 0 if the accumulator is negative, or, (51 2 - 2RL) if the accumulator is positive. 
The ramp counter (flip flop 294) is incremented on every clock where the accumulator 292 is positive. Note 
that thestep mode requires the multiplexer 254 to bypass the latch 252 during the START cycle. This allows 
the new value in the CONSTI module 265 to be loaded into the accumulator 292 in the START cycle. The mul- 
10 tiplexers 280 and 282 are shown in Fig. 10 for clarity only. A simpler implementation for the multiplexer 280 is 
a string of AND gates. Similarly, the multiplexer 282 can be implemented as a string of OR gate for the nine 
least significant bits, and a string of AND gates for the most significant bits. 

The colour component data path of the graphic engine 1 0 consists of three eight-bit data paths. Each eight- 
bit path contains an interpolator 400, which also performs colour correction, and a compositor 500. The red, 
15 green and blue interpolators are identical. The three compositors, however, are slightly different The green 
compositor is used by the matte channel in some instructions and the resulting differences in the green channel 
are described later in this specification. The colour interpolators 400 comprise three modules: a synchronisation 
module 41 0, an interpolation phase A 420, and an interpolation phase B 450. The synchronisation module 410 
spans stages 0 and 1 of the graphics pipeline and holds operands for the current instruction, and instruction 
20 operands for the next instruction. The interpolation phases A and B (420 and 450) perform blends and colour 
correction. The two phases are required in order to maintain one pixel per clock cycle. Phase A (420) is in stage 
2 of the pipeline, and phase B (450) is in stage 3. 

The synchronisation module 410 is not shown in detail but however comprises latches and flipflops which 
provide the start colour, the end colour and the pixel value. 
25 The interpolation phase A 420 is shown in Fig. 13 and performs two types of calculations, either blends, 

or colour corrections. Those skilled in the art will appreciate that colour runs are simply a degenerate case of 
a blend when the start and end colours are the same. The phase A interpolator 420 performs calculations for 
the first part of both types, namely 

for blends, CJVIULT = (END -START)* RAMP /256 
30 for colour correction, C_MULT = (END*PIXEL/256). 

The interpolator phase A420 comprises a summer 4Z2 which subtracts the START value from the END 
value. The multiplexers 424 and 426 change the inputs to the multiplier 430 respectively. For blend commands, 
the summer 422 has a nine-bit two's complement output which is fed to the multiplier 430. For colour correction 
commands, the END value (which also corresponds to the CONTRAST) is fed to the multiplier 430 when the 
35 sign bit is set positive. In blend commands, the second inputof the multiplier 430 comes from the ramp generator 
250 of Fig. 1 0 and has the effect of controlling the mixing proportion. In colour correction commands, the second 
input of the multiplier 430 comes from the commands pixel string. A new pixel to be colour corrected is loaded 
every non-stalled clock period. Only the top nine bits and the sign of the multiplier output are required. The 
output is a 10-bit two's complement number. Rounding is not performed as a one least significant bit error is 
40 considered acceptable. 

The START_D output contains a delayed version of the START value for the next phase of the interpolator. 
In colour correction mode, the START value is equivalent to the BRIGHTNESS. 

Phase B 450 of the interpolator 400 performs the second part of both of the blends, and colour correction 
calculations, namely 

45 for blends, D_OUT = C_MULT + START.D = ((END-START)*RAMP/256) + START 

for colour correction, D_OUT = (C_MULT*2) + START.D = ((EN D_START)* RAMP/1 28) + START 
It should be noted that rounding errors introduced by the multiplier 430 of phase A 420 lead to an error of 
one in the final value of the- blend. That is 

for END-START>0, D_OUT(FINAL) = END-1 
50 for END-START>0, D_OUT(FINAL) = END + 1. 

The phase B circuit 450 comprises two multiplexers 452 and 454 both supplying a summer 458. The sum- 
mer 458 supplies a limiter 460 which, in turn, supplies the output via a latch 462. The multiplexer 452 changes 
the inputs to the adder 458 to perform the multiplication by 2 for the colour correction function. The most sig- 
nificant bits pass straight through to the adder 458. For blend commands, this has the effect of signextending 
55 bit 8 by one position. 

The multiplexer 454 adds an extra two inputs to the adder 458 to perform a sign-extension for the colour 
correction function, which uses a 8-bit signed format for the BRIGHTNESS. For blend commands, these two 
bits are set to 0, as the START value is unsigned. The adder 458 sums the two 10-bit signed numbers output 
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15 



20 



from the multiplexes 452 and 454 to produce a 1 0-bit signed result, and a positive overflow indicator (POV) A 
negat,ve overflow is not possible in either blend or colour correction modes. In blend mode, the start value is 
always positive, and in colour correction mode, the MULT value is always positive 

The adder 458 output is passed to the limiter 460 which is used to saturate values greater than 255 to 255 

our sourirP r m 1 f^T be dbab ' ed -" y a " EFFECTS si 9 nal which comes fr °-» *• COU 

OUR SOURCE field ,n the command. The limiter 460 has no effect in run or blend commands as the adder 

458 output is always in the range 0 ... 255. The output from the limiter 460 is latched (462) in every non-stalled 
clock period, in readiness for the next pipeline stage. 

,„ tu r^""^! 60 iS " 0t Sh ° Wn deta " 35 " S confi 9 uration can be readily derived from the above description 
10 The limiter 460 has an input bus range from -1 28 to +638. 

™ F J 9 ,'„ 15 Sh T 3 ~ lour A c °";P ositer 500 whic h '"eludes four modules comprising a synchronisation module . 
520 a compositing phase A 540, a compositing phase B 560, and a matte multiplexer 580. The synchronisation 
module 520 ,s .n stage 4 of the graphics pipeline. It latches the RGBM data from the external compositing or 

h «T en J nem °T, <28 ° f 30) 3nd s y nchronises « with P'xel Pipeline. Compositing phases A and B 540 
and 560 perform a blend between the original value in the external memory, and the new value from the cor- 
respondmg interpolator 400. The proportion between original and new is controlled by the BLEND bus which 
comes from the matte combiner 700. Two phases are again required in order to maintain one pixel per clock 
cycle phase. Phase A540 is in stage 5 of the graphics pipeline and phase B 560 is in stage 6. The matte mul- 
tiplexer 580 allows the data from the matte channel to be output on one or more of the colour planes TrTis 
performed in stage 7 of the pipeline. The G OUT bus is shown shaded as it only exists in the green channel 
for use with the matte combiner 700. siwm uwinei 

The synchronisation module 520 is not shown in detail as it merely latches data from external memories 
to provide that data synchronously with the phase A unit 540. The STALL signal is delayed in this module by 

« Z™?™ to 1 T P T e f ° r ' he ° ne C ' OCk delay when readi "9 from tne compositing or workscreen mem- 

25 ory 28 or 30 as earlier noted. 

Phase A 540 of the compositor 500, seen in Fig. 16, performs the following calculation: 

CJvlULT = (C_ORIG - C_NEW)* BLEND/2 56 
Phase A 540 includes a summer 542, a multiplier 546 and output latches 548 and 550. All commands use 
™ T t u 8 " bit unsigned - va,ues on each of tne NEW, ORIG, and BLEND inputs. The output from 

so the adder 542, wh.ch, having an inverted input functions as a subtracter, is a 9-bit two's complement number 
which is fed to the multiplier 546. The second input to the multiplier 546 comes from the matte combiner 700 
and has the effect of controlling the mixing proportion. A value of 0 represents a fully opaque compositing. Only 
the top 8 bits and the sign of the multiplier 456 output are required. The output is a 9-bit two's complement 
mpw n R ~ n f infl ' a 9 a ' n ™ l Performed as a one least significant bit error is considered acceptable. The 
35 NEW D output contams . a delayed version of the NEW value for the next phase (560) of the compositor 500 
Phase B 560 of the compositor 500 is shown in Fig. 1 7 and performs the following calculation- 

DJvlULT = CJvlULT + C_NEW_D = ((O RIG-NEW)* BLEND/256) + C_NEW 
Note that rounding errors introduced by the multiplier 546,in phase A 540 lead to an error of at least one 
for non-opaque compositing. For example: 
to when ORIG-NEW>0, 

D_OUT (fully transparent) = ORIG-1 , 
when ORIG-NEW<0. 
D_OUT (fully transparent) = ORIG+1 . 
Fully opaque compositing will always produce the exact NEW value That is- 
45 D_OUT(fully opaque)=NEW. 

A „fh h ?h Se ,f h 5 f ° inClUd ! S '." add6r 562 WhiCh SUmS *"> 9 " bit Si9ned number from ,he """iP"^ S46 in phase 

t»ln? ^« S ' 9 ?! Va ' Ue fr ° m ,he ° RIG d6lay flip - fl0p 550 also in P hase A 54 °- Th * result is always in 
the range 0 to 255 as the compositor 500 can only produce numbers between the ORIG and NEW values. The 

so sta^re if P e d ne er '** *** n ° n - Stal,ed dock P"** in readiness «* output 

700 ST If* P3 !' See " Fi9 ' 2 com P rises a transparency interpolator 600 and a matte combiner 

Leen a S^ART fndFMnT 5 ma '" ^"T" 3 ' '° 9<!nerate 3 «*-P««V °y interpolating be- 
tween a START and END transparency, and to combine the transparency blend with the matte plane. 

m „dL^n nSParenCy , , nt ( erp0,at0r 600 iS Sh0W " in Fi9 - 18 havin 9 three modules includin 9 a synchronisation 
Tnl lL n ' nt !T° ,T" PhaSe A 640> and a " inter P° la «°" P^se B 670. The synchronisation module 610 
spans stages 0 and 1 of the pipeline. It holds operands for the current instruction, and latches operands for the 

TZf^T 'T haSeS ( A640 and B 670 perf °™ transparency b.ends and texturing. Again two phases are 
required in order to maintain one pixel per clock cycle. Phase A is in stage 2 of the pipeline and phase B is in 
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stage 3. 

Fig. 19 shows the various components within the synchronisation module 610. Latches 612 and 614 hold 
the START and END transparency parameters for the next instruction. This corresponds to stage 0 of the 
pipeline. Rip flops-620 and 622 load these values when the START signal indicates the first pixel of a new 
instruction is in progress. A third latch-616 holds the texture start parameter for the next instruction. This is a 
2-bit parameter which determines the byte offset into the initial texture word for texture run and blend com- 
mands. It is loaded into a third flip-flop 624 via a next offset module 618 when the START signal indicates the 
first pixel of an new instruction is in progress. A fourth flip-flop 626 holds the texture for up to four pixels in texture 
run and blend commands. The flip-flop 626 is loaded from the data bus D_IN in every fourth non-stalled clock 
period. A fifth flip-flop 628 holds the transparency for each pixel value in pixel string commands, and is loaded 
from the data bus in every non-stalled clock period. 

The next offset module 618 is used to control the texture byte offset, and the texture word flip-flop 626. For 
texture run and blend commands, the texture byte offset is Initialised when the START signal is asserted, and 
counts by one in other non-stalled clock periods. When the offset equals 3. the LASTT signal is asserted.' The 
texture word flip-flop 626 enable signal is also asserted at this time as well as during the start cycle 

Phase A 640 of the interpolator 600 is shown in Fig. 20 and performs the following calculation: 

T_MULT = (T_EN D-T_START) * RA M P/256 . 

Phase A 640 includes a summer 642 configured to subtract two input data bytes. The output of the summer 
642 is a 9-bit two's complement output which is fed to a multiplier 644. The second input of the multiplier 644 
comes from the ramp generator 150 of Fig. 10 and has the effect of controlling the mixing proportion between 
the start and end values. Only the top eight bits and the sign of the multiplier 644 output are required. The output 
is a 9-bit two's complement number. Again, rounding is not performed as one least significant bit error is con- 
sidered acceptable. Phase A 640 also has a T_START_D output which contains a delayed version of the 
T— START value provided by a latch 652 for the next phase of the interpolator 600. A multiplexer 646 uses the 
T_OFF signal from the synchroniser 61 0 to select one of the four texture bytes in the T_PIXELS input In texture 
run and blend commands, a different byte is selected in each pixel clock PXCLK period. In pixel string com- 
mands, the transparency is selected from the T_PIXELS input via a second multiplexer 648. The texture is 
latched (654) to maintain synchronisation with pipeline stage 2. 

Phase B 670 of the interpolator 600 is shown in fig. 21 and performs the second part of blend calculations, 
namely: 

TR_OUT = T_MULT + T_START_D = ((T_END-T_ST ART)* RAMP/256 + T-START. 
Note that rounding errors introduced by the multiplier 644 in phase A 640 lead to an error of one in the final 
value of a blend. That is: 

for T_END-T_START>0,TR_OUT(FINAL) = T-END-1 
for T_END-T_START<O.TR_OUT(FINAL) = T_END + I 
In Fig. 21, an adder 672 sums the 9- bit signed 

value from the multiplier 644 in Fig. 20 with the 8-bit start value. The result is always a positive 8-bit value 
since it must be in the range T_START ... T_END. A multiplexer 676 selects the output of the adder 672 for 
run or blend commands, or the texture byte for other commands. In bitmap commands, and commands which 
do not have subsequent words, the opaque signal is asserted. This has the effect of forcing the texture to 25S 
(opaque). 

The output from the multiplexer 676 is latched (678) in every non-stalled clock period, in readiness for the 
next pipeline stage. The transparency value from the output of the "opaqueing" module is also latched (680) 
and passed separately to the matte combiner 700. The matte combiner 700 uses this value when the "matte 
source = subsequent" option is specified by the instruction set. 

The matte combiner 700 is shown in Fig. 22 having four modules including a synchronisation module 710, 
a matte calculator 730, a matte delay 720. and a green multiplexer 790. 

The synchronisation module 710 is in stage 4 of the pipeline. It latches the matte plane of the RGBM data 
from the external compositing 4 or workscreen 30 memory in the first half of the pixel clock period. Based on 
this value and the transparency from the transparency interpolator 600, the matte calculator 730 produces a 
blend value for the colour compositors 500, and a new matte value for the compositing memory 28. Matte cal- 
culations are performed in stage 4 of the pipeline. The delay 720 passes the matte value unchanged to the 
green multiplexer 790 to synchronise with the colour components which are calculated in stages 5 and 6. The 
green multiplexer 790 allows the data from the green channel (400, 500: Fig. 2) to be output on the matte plane 
of the compositing memory 4. This is performed in stage 7 of the pipeline. 

Fig. 23 shows the components within the matte synchronisation module 710. A latch 712 produces a 
delayed version of the STALL signal to compensate for the one clock delay in the compositing memory 28. The 
input data from the external memory is asynchronous to the main internal pixel clock PXCLK. It is, however, 
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synchronous to the CLK 2 signal which is used to derive the pixel clock via a NAND gate 714 so the chase is 
fixed. This data is latched (716) in the first half of the pipeline stage 4. 

The matte calculator 730 is seen in Fig. 25 and performs two calculations: 

BLEND = LIMIT (TR_IN-M_SYNCD) 
M_OUT = LIMIT (M_SYNCD-TR IN) 

where LIMIT (a-b) = 0, if a - b < 0 

Both the TRJIM and the M_SYNCD inputs are 8-bit unsigned values. The components in the top half of 
Fi£ 25 calculate the BLEND output which is used to control the mixing proportion in the colour compositors 
500. input to this block with a value of 0 represents fully transparent compositing. An output value of 0 however 
represents fully opaque compositing. The BLEND value is calculated by substracting, using an adder 734 the 
matte data .n the compositing (or workscreen) memory 28 from the transparency which has either a run blend 
or texture value. A limiter 738 saturates this value to 0 if the result is negative, as indicated by the carry bit from 
the adder 734. A multiplexer 742 can bypass this calculation and select the input transparency if the T MI- 
NUS M input « not asserted (the T_MINUS_M signal is decoded from the command word in the controfunit 
300.) To compensate for the different input and output sense of the transparency, the multiplexer 742 is normally 
inverted by an EXOR block 750, or is passed straight through to the output via a latch 752 when the T INV 
signal is asserted. - 

The components in the bottom half of Fig. 25 calculate the M_OUT output which is written into the matte 
plane. This value is calculated by subtracting, using an adder 736, the transparency from the matte data in the 
compositing (or workscreen) memory 4,30. A further limiter 740 saturates this value to 0 if the result is negative 
as indicated by a carry from the adder 736. A subsequent multiplexer 746 can bypass this calculation and select 
the input transparency if the M_MINUS_T input is not asserted. A further multiplexer 748 selects the SUBS T 
value if the matte source field of the instruction specifies "subsequent". Both the BLEND M_OUT VALUES are 
latched to synchronise to stage 4 of the pipeline; 

The matte delay 720, seen in Fig. 24, passes the matte value unchanged to the green multiplexer 790 to 
synchron.se with the components which are calculated in stage 5 and 6. A flip-flop 722 synchronises to stage 
5, and a second flip-flop-754 synchronises to stage 6. 

The address generator block 800 of Fig. 2 is responsible for providing read and write addresses for the 
compositing memory 4 and the workscreen memory 30. A read cycle and a write cycle are performed in each 
pixel clock period. Compositing memory 4 addresses are provided by a page - address generator within the 
address generator 800, and workscreen addresses are provided by a screen address generator also within the 
generator 800. The address generator 800 operates in a manner known to those skilled in the art in selecting 
appropnate addresses within the workscreen and compositing memories. Having selected the appropriate 
address, either the page or screen addresses are multiplexed as determined by read and write signals The 
write addresses are passed through a five stage delay to compensate for delays through the colour and trans- 
parency data paths. 

The write controller 900 of Fig. 2 performs three main tasks, namely: 

1. To implement additional functionality required for "bitmap mode" instructions; 

2. To provide appropriate pipeline delays of write control and other status information- and 

3. To interface to the compositing memory 28 and the pan/zoom engine 9 handshake'signals 

L.ke the address generator 800, the write controller 900 forms functions known to those skilled in the art 
and a detailed explanation is not required in the specification. Essentially, the write controller 900 includes three 
stages comprising a bit map shift, a status delay, and an I/O control unit. In bit map mode instructions, a single 
32-bit word is used to enable or disable writes for up to 32 consecutive pixels. The status delay module provides 
appropnate pipeline delays of write control and other status information. It also contains a block which is used 
to detect when the pipeline is empty. The 110 controller interfaces to the compositing memory 4 and the pan- 
/zoom engine 9 handshake signals. 

The foregoing description provides an architecture by which a graphics engine 10 can be produced to 
achieve colour desktop publishing functions. 

The preferred embodiment of the graphics engine is able to implement a wide range of commands so as 
wa o a «!SS a !? ' ma9e dat3 ' F ' Ve SUCh commands respectively known as CANVAS RUN, CANVAS BLEND CAN- 
VAS PIXELS, CANVAS TEXTURE RUN and CANVAS TEXTURE BLEND will now be described. ' 

The CANVAS RUN function composites a run of colours with the image, with compositing controlled by 
the transparency and the matte plane. This command simulates the interaction of paint with canvas or other 
textured materials, as textured dents in the background progressively "fill up" with "paint". The canvas run com- 
mand composites the run length number of pixels of a colour equal to the start colour, starting at the page 
address. The intensity of compositing is controlled by the difference between the transparency run and the 
matte plane of the image. The matte plane is changed to, be the difference between the matte value and the 
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run transparency. 

The operation is as follows for n=0 to Run_length-1 

RGB (image) [address+n] 
= ( (T (start) .Matte (image) [address+n]) *RGB (start) 
+ (256- <T (start) .Matte (image) [address -Hi] ) ) 
*RGB (image) faddress+n] ) /256 

Matte (image) [address+n] 
= Matte (image) [address+n] .T (start) 

where . represents a subtraction operation with underflow limiting. 

The canvas run command is used for compositing object based constant colour "brush strokes" onto a text- 
ured background, where it is desirable to simulate the cumulative interaction between paint and a textured back- 
ground. To achieve this effect, the page matte should contain an image which corresponds to the texture of 
the background. 

The CANVAS BLEND function composites a blend of colours with the image, with compositing controlled 
by the blend transparency and the matte plane. This command simulates the interaction of paint with canvas 
or other textured materials, as textured dents in the background progressively "fill up" with "paint". 

The canvas blend command composites the run length number of pixels of a colour smoothly changing 
from the start colour to the end colour with the image, starting at the page address. The intensity of compositing 
is controlled by the difference between the blended transparency and the matte plane of the image. The matte 
plane is changed to be the difference between the matte value and the blend transparency. 

The operation is as follows for n=0 to Run/length-1 
Interpolation [n] = n/ <Run_length-l ) 
T[n] = Interpolation [n] *T (end) 

+ (1-lnterpolation [n] ) *T (start) 
RGB [n ] = Interpolation^] *RGB(end) 

+ (1-interpolation [n] ) *RGB (start) 
RGB (image) [address+n] 
= ( (T[n] .Matte (image) [address+n] ) *RGB [n] 
+ (256- (T[n] .Matte (image) [address+n])) 
*RGB (image) [address+n] ) /256 
Matte (image) [address+n] 
= Matte (image) [address+n] -T[n] 

where . represents a subtraction operation with underflow limiting. 

The canvas blend command is used for compositing object based blended colour "brush strokes" onto a 
textured background, where it is desirable to simulate the cumulative interaction between paint and a textured 
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background. To achieve this effect, the page matte should contain an image which corresponds to the texture 
of the background. 

The CANVAS PIXELS function composites a run of pixels with the image, with compositing controlled bv 
the p.xel transparency and the matte plane. This command simulates the interaction of paint with canvas or 
other textured materials, as textured dents in the background progressively "fill up" with "paint" 

The canvas pixels command composites the run length number of pixels with the image, starting at the 
page address. The intensity of compositing is controlled by the difference between the pixel transparency and 
the matte plane of the image. The matte plane is changed to be the difference between the matte value and 
the pixel transparency. 

The operation is as follows for n=0 to Run_length-1 

RGB (colour corrected) [n] 

•i 

= Limit ( (RGB (contrast) *RGB (pixel) [nJ)/256+ RGB (bright) ) 

RGB (image) [address+n] 
= ( (T (pixel ) .Matte (image) [address+n]) 
*RGB (colour corrected) fn] 

+ (256-(T (pixel) .Matte (image) [address+n])) 
*RGB (image) [address+n] ) /256 
Matte (image) [address+n] 
= Matte (image) [address=nj . T (pixel) 

where . represents a subtraction operation with underflow limiting. 

The canvas pixels command is used for compositing pixel based "brush strokes" onto a textured back- 
ground, where .t is desirable to simulate cumulative interaction between paint and a textured background To 
ach.eve this effect, the page matte should contain an image which corresponds to the texture of the background 
Th.s command lean i alsc > be used for simulating screen printing of a natural image onto a textured background.* 
fr n h k £ TEXTURE RUN function composites a run of colours with the image, with compositing con- 
trolled by the texture and the matte plane. This command simulates the interaction of textured brush strokes 
wrth canvas or other textured materials, as textured dents in the background progressively "fill up" with "paint" 

The canvas run command composites the run length number of pixels of a colour equal to the start colour' 
starting at the page address. The intensity of compositing is controlled by the difference between the i subse- 
quent texture bytes and the matte plane of the image i The matte plane is changed to be the difference between 
the matte value and the texture bytes. 

The operation is as follows for n,0 to Run_length-1 
RGB (image) (address+n] 
= ( (Texture (n] .Matte (image) [address+n] ) *RGB (start) 
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+ (256- (Texture [n] .Matte (image) [address+n] ) ) 
*RGB (image) [address+nj ) /256 
Matte (image) [address+n] 
= Matte(image) [address+n ]. Texture [n ] 

10 where . represents a subtraction operation with underflow limiting. 

The canvas texture run command is used for compositing textured object based constant colour "brush 
strokes" onto a textured background, where it is desirable to simulate the cumulative interaction between paint 
and a textured background. To achieve this effect, the page matte should contain an image which corresponds 
to the texture of the background, and the subsequent texture bytes should contain a texture which relates to 
15 the texture of the paint medium. 

The CANVAS TEXTURE BLEND function composites a blend of colours with the image, with compositing 
controlled by the texture and the matte plane. This command simulates the interaction of textured brush strokes 
with canvas or other textured materials, as textured dents in the background progressively "fill up" with "paint". 
The canvas blend command composites the run length number of pixels of a colour smoothly changing 
20 from the start colour to the end colour with the image. The intensity of compositing is controlled by the difference 
between the subsequent texture bytes and the matte plane of the image. The transparency values of the start 
and. end colour are ignored. The matte plane is changed to be the difference between the matte value and the 
texture bytes. 
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The operation is as follows for n-O to 
Blend_length-1 

Interpolation [n J = n/ (Run_length-1) 

RGB [n ] = Interpolation [n] *RGB (end) 
+ (l-Interpolation[n] ) *RGB ( start) 

RGB (image) [address+n] 
= ( (Texture [n] . Matte (image) [address+n] ) *RGB [n] 
+ (256- (Texture [n] .Matte (image) [address+n])) 
*RGB < image) [address+n] ) /256 

Matte (image) [address+n] 
= Matte (image) [address+n] . Texture [n] 



where . represents a subtraction operation with underflow limiting. 

The canvas texture blend command is used for compositing textured object based constant colour "Brush 
so strokes" onto a textured background, where it is desirable to simulate the cumulative interaction between paint 
and a textured background. To achieve this effect, the page matte should contain an image which corresponds 
to the texture of the background, and the subsequent texture bytes should contain a texture which relates to 
the texture of the paint medium. 

It will be apparent to those skilled in the art that the above described functions result in the matte plane 
being changed by either the colour information (transparency) of the image or the texture information of the 
image. Subsequent manipulations are based on the changed matte plane. 

The foregoing describes only one embodiment of the present invention , and modifications, obvious to those 
skilled in the art can be made thereto without the party from the scope of the present invention. 
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For example many of the individual building blocks described include a latch or latches „h,«* 
buffer data out of that respective unit. Those latches are nnlv „c„h i„ ,k Ch are used to 

synchronisation throughout the various pipeline ^qe^hrouot th* t „• PreS6nt embodimen < <° enable 
capable of operating at much highe • wc^lta 9 " ? ""S'"* 10 " Wth hardware ««* is 

become insignificanfand the reg^n^^ « 
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1XXX 

0000 

0001 

0010 

0011 



0100 
0101 
0110 
0111 



:2fl Type 



Compos i t\ ng 
Synchroni sation 
Synchronisation 
Configuration 
Configuration 



Instrnrt^n 



Configuration 



Compos i ting 



Bit# 15 14 13 1 

Function 



COMPOSITE 

NOP 

WAIT 

LOAD ZOOM 

LOAD CONFIGURATION 

LOAD REGISTER 

spare 

spare 

EXECUTE UC0DE 



f iCldS Parampf - prtf 



Multiple Mu-ltlple 
HW/SH/PL 

Zoom ratios 
Skip mode, write 
enables, mux 
enables 
Register # Register value 



TABIF 7 



2 U '° 9 8 7 6 5 4 3 2 , o 
RE3 RF? RF1 RFO w fy? m FYn 

TABLE 3 



Bit * 15 14 13 12 11 

faction 



10 9 8 7 6 5 4 3 2 1 0 
- SBR SBH SFR SEH M?ft M ? r H ?B HFR Mrn urR u ^ 

TABLE 4 



r 8it# 31 30 29 26^5lT7 3 22 21 20 19 18 17 16 

Function n i T , ^ u Iy 18 17 16 
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TABLE 5 



Red ff 


NAME 


DESCRIPTION 


0 


ST_RE0 


Start value of red component for run or-'blend. Also used 






as red Brightness. 


1 


ST_GREEN 


Start value of green component for run or blend. Also 






used as green Brightness. 


2 


ST_BLUE 


Start value of blue component for run or blend. Also 






used as blue Brightness. 


3 


ST_TRANS 


Start value of transparency component for run or. blend. 


4 


ND_RE0 


End value of red component for run or blend. Also used 






as red Contrast. 


5 


ND_GREEN 


End value of green component for run or blend. Also used 






as green Contrast . 


6 


ND_BLUE 


End value of blue component for run or blend. Also used 






as blue Contrast. 


7 


ND_TRANS 


End value of transparency component for run or blend. 


8 


P_START_H 


Horizontal Page Start address. 


9 


P_START_V 


Vertical Page Start address. 


10 


S_START_H 


Horizontal Screen Start address. 


11 


S_START_V 


Vertical Screen Start address. 


12 


UCODE 


Microcode register. 


13 


ALT_RL 


Alternate run length register. 


14 


T_OFF 


texture offset register. 


TABLE 6 


Bit n 


31 30 


29 28 27 26 2S 24 23 22 21 20 19 18 17 16 


Function 0 0 


0 1 HWR SWR PI R 



TABLE 7 



ADDRESS 


MODE 


NAME 


SIZE 


DESCRIPTION 


00 


WRITE 


COMMAND 


32 


Command Input 


01 


WRITE 


CONTROL 


16 


general control 




REA0 


STATUS 


1$ 


general status 
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tabi c a 



61 1 # Ti^n 9c 




23 22 . . 17 16 


15 14 


9 


8 7 


6 




1 0 


Instruction COM 


M A 


N 0 


T.O. 




R. 


L 






Operands 7 R 


0 


7 G 0 


7 R 




0 7 




T 








TABLE 9 












0 


Bit # 15 14 13 12 


1 1 


10 9 8 7 


6 5 


4 


3 


2 


1 


0 


Function HIF STF FJF C\v^ 


CSI 


CEI RST swr <;ff 


OF TMS 


TMd 


TM1 


TM? 


TLM 




— — • — ^lj-j — -j ■ v ■ ■ is. nit inii 

TABLE 10 


Bit* 15 14 13 12 


11 


10 9 8 7 


6 5 


4 


3 


2 


1 


0 


Function HIE SIE FIF H r F 


SIF 


FIF FFF GR DOE 


TS3 TS? 


TS1 


T50 


V? 


VI 


vo 
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TABLE 11 



T YPe 


name 


it inDUts 


^outputs 


Ml I_ t -J I _ ^ 


Oescrl Dtlon 


1960 


HCLK 


1 




■ 


1960 Processor clock. 33MHz 




RESET- 


1 






Power on reset 




HDC31..0) 






32 


Host data bus 




CE- 


1 






chip mable 




RS 


1 






register select 




READ- 


1 






read enable 




DACK- 


1 






DMA acknowledge 




NEXT- 


1 






next DMA word enable 




OREQ- 




1 




DMA request 




T UT 




1 

1 




Interrupt reauest 


Comp. 


CLK2 


1 






twice pixel clock, 27MHz 


Mem. 


PXCLK 


1 






pixel clock, 13.5MHz 




CD_0E- 


1 






compositing data bus output 












enabl e 




CD<31..0> 






32 


compositing data bus 




LADC12. .0) 




13 




compositing line address 




PAD<12..0> 




13 




compositing pixel address 




BUFWI- 


- 


1 


- 


buffer write Image 




our rtn— 








buffer wr i te matte 




8UFR- 








buffer read 


pan/ 


GFX_ST- 








start of graphics run 


zoom 


VERT- 









screen vertical run 




LD_Z00M- 


_ 


1 


_ 


load zoom ratios 




SC^WE-(3..0) 


_ 






screen byte lane write enables 




SC_REA0 




} 




screen read 




SC_HRITE 





! 




screen write 




SC_RDACK- 


1 






screen read data acknowledoe 




SC^WDAOC- 


1 






screen write data acknowledge 


other 


TESTIN 


1 






test mode Input 




TESTOUT 




1 




test mode output 




ERR- 




1 




error indicator 




GFX RUN- 




1 




graphics engine running 












indicator 




RESTART- 


1 






restart command from VINEGAR 




TEST(2:0> 


3 






test inputs 




tfstt 




1 




test outout 
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Claims 



10 



15 

2. 

20 

3. 



25 



6. 

35 

7. 

40 



A graphics engine to manipulate colour image data on a pixel-by-Dixel ba^ «;h «n„;n^ ^ 
Parana, data path for each one of three prima'ry colours of said Se da ^each safd pL.h * 
cascade connected interpolator and compositor and said data paths being connected .n paraZ S ' 
cascade connected transparency interpolator and matte combiner wherein eacTof sal cTlourtL 
po ators receive coiour variation commands, each of said compositors receive ^fhX£ toZS,* 
sa,d pixel ,n turn and said compositors transform said colour input data in accordant w" fa ore d!t 
rnmed function the output of which is dependent upon both the input co\ou"teti TaZT^Z> 
ation command, and wherein said transparency interpolator receles^ Zpa^ency va^ 
and sa,d matte combiner receives matte plane input data for each said pixefand fsaid combL^ ™,f ' 
a colour blend value and an output matte value for each said pixel by trans ^^t^^TT* 
;n accordance with said transparency variation commands, the output ^^M^X^TrT 1 
.mage data simultaneously manipulated ascolour. transparency and matte 9 9 '° Ur 

A graphics engine as claimed in claim 1 wherein said colour variation commands are selected from th» 
group consisting of blends and colour correction. e se,ected frar " the 

A graphics engine as claimed in claim 1 or 2 . wherein said colour variation commands include a start va.u. 
^J^ElT^ ^ inPUtd3ta fe era specinc duration dSSS 

A graphics engine as claimed in any one of claims 1 to 3 wherein trancr,^. 

are selected from the group consisting of transparency b^.^ ^T^"^ Vanat, ° n """"^ 

A graphics engine as claimed in any one of claims 1 to 4 wfwpin tho trfl „ eM «„ 

cl, 1 Th J , y Sa ' d COm P° sitors to form «W h»ge Part which is simultaneously tex ured and 
"aL h :mTge e £ ne ^ ^ -» - 

A colour image data manipuiation system including a graphics engine as claimed in any one of claims 1 



45 



SO 9. 



10. 

55 



8. A ramp generator comprising: 

means for storing a number (N) as a plurality of bits of informaUon- 
m ined:m^m~ a nd r * beI *" M ^ermined "'nimu m and a predete, 

means for testing the most significant of said bits to establish either a steo mode or * i„mn m ^ 
of operation, whereby in said step mode, said output increases ma ^J£^fi£ft?£^ 

fano^nd ".IT' t S . C ' aimed in Claim 8 wherein said Predetermined minimum is 0% of the interpolation 
range and sa,d predetermined maximum is 100% of the interpolation range. interpolation 

Tnrctior rat ° r 38 C ' aimed C ' aim 8 " 9 WherSin S3id " Umber * <"* ™ length of a graphics 



11. 



A ramp generator as claimed in any one of daims 8 to 10 wherein said number includes at least 8 bite 



mode if said number is less than or equal to 255 

23 



BNSDOCID: <EP 0465250A2_t_> 



EP 0 465 250 A2 



12. A ramp generator as claimed in any one of claims 8 to 11 wherein said accumulator comprises a first mod- 
ule for calculating 255/(N-1), a second module for calculating (51 1-N) and a third module for calculating 
(512-2N), there being a first multiplexer connected to a zero vaue and an output of said second module, 
a second multiplexer connected to a constant (510) and the output of said third module and a third mul- 
tiplexer having inputs connected to the output of said first module and the output of said second multiplexer, 
and adder having one input supplied by the output of said third multiplexer and outputting to a fourth mul- 
tiplexer having its other input connected to the output of said first multiplexer, and an accumulating flip 
flop connected to the output of said fourth multiplexer whose output supplied an output of said ramp 
generator and a second input to said adder. 

13. A ramp generator as claimed in claim 12, and comprising a counter connected to the output of said 
accumulating flip-flop and having its count value incremented thereby, and a fifth multiplexer having one 
input connected to the count value output of said counter and the other input connected to said output of 
said fourth multiplexer, the output of said fifth multiplexer being switchable between said count value to 
provide a step mode ramp output and said fourth multiplexer output to provide an alternate, jump mode 
output. 

14. A graphics engine as claimed in claim 3 or claim 5 including the ramp generator as claimed in any one of 
claims 8 to 13 and wherein said specific duration determines the ramp output of the ramp generator. 

15. A desktop publishing system comprising a computer (1), a colour output device (3,7) and a rendering pro- 
cessor (10) arranged to perform rendering operations (e.g. colour fills, mattes, or mixing two images) to 
generate image data supplied to said output device. 

16. A system according to claim 15 which includes a workscreen memory (30) coupled to an image display 
apparatus (3) and a page memory (4) coupled to an image printer (7), said rendering processor (1 0) being 
arranged to access said memories (30, 4). 

17. Apparatus for processing image data to provide a colour image output, which comprises means for operat- 
ing upon red, green, blue, and matte or transparency information, wherein separate means are provided 
for each of the colour information and the matte information is processed by one of said colour processing 
means. 

18. A graphics engine including a grading circuit arranged to interpolate across a run of pixels from a first value 
to a second value, said grading circuit including a digital counting circuit arranged to produce a ramp of 
successive values for use on successive pixels. 

19. Image processing apparatus comprising a rendering processor (10) for performing colour and matte oper- 
ations on image data and, coupled thereto, a scaling processor (9) arranged to accept image data and to 
produce output image data of a different resolution corresponding thereto so as to expand or shrink por- 
tions of the image. 
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